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Abstract 

Adaptation of asexual populations is driven by beneficial mutations and there¬ 
fore the dynamics of this process, besides other factors, depend on the dis¬ 
tribution of beneficial fitness effects. It is known that on uncorrelated fitness 
landscapes, this distribution can only be of three types: truncated, exponential 
and power law. We performed extensive stochastic simulations to study the 
adaptation dynamics on rugged fitness landscapes, and identified two quantities 
that can be used to distinguish the underlying distribution of beneficial fitness 
effects. The first quantity studied here is the fitness difference between succes¬ 
sive mutations that spread in the population, which is found to decrease in the 
case of truncated distributions, remain nearly a constant for exponentially de¬ 
caying distributions and increase when the fitness distribution decays as a power 
law. The second quantity of interest, namely, the rate of change of fitness with 
time also shows quantitatively different behaviour for different beneficial fitness 
distributions. The patterns displayed by the two aforementioned quantities are 
found to hold for both low and high mutation rates. We discuss how these pat¬ 
terns can be exploited to determine the distribution of beneficial fitness effects 
in microbial experiments. 

Introduction 

Microbial populations have to constantly adapt in order to survive in a chang¬ 
ing environment. For example, a bacterial population exposed to a new an¬ 
tibiotic must evolve in order to exist pQ. In asexual populations, this process 
of adaptation is driven only by rare beneficial mutations [2] which provide fit¬ 
ness advantage. So in order to survive in new environment, enough beneficial 
mutations should be available and the beneficial mutations should confer suffi¬ 
cient fitness advantage. While the first factor depends on the mutation rate and 
population size, the second factor is determined by the underlying fitness dis¬ 
tributions. Even though we have some understanding about the mutation rate 
of different microbial populations, the full fitness distribution is more complex 
and relatively little is known about it. But for moderately adapted populations 
(i.e. fitness of the wild type is high enough), rare beneficial mutations which 
occur in the tail of the fitness distribution can be described by the extreme 
value theory (EVT) as proposed first by Gillespie [5]. The EVT states that the 
extreme tail of all distributions of uncorrelated random variables (fitness, in this 


case) can be of three types only. Depending on whether the tail of underlying 
fitness distribution is truncated or decaying faster than a power law or as a 
power law, the EVT distribution would belong to Weibull or Gumbel or Frechet 
domain, respectively [J]. All the three EVT domains can be obtained from the 
generalized Pareto distribution given as 

P(f) = (! + (1) 

where k is the tuning parameter. One example from each of the three EVT 
domains is shown in Fig. [T] which shows the distribution of beneficial effects 
p(f) with fitness /. The three types of EVT domains are classified according 
to the value of k. Here negative k belongs to the Weibull domain, while k = 0 
corresponds to Gumbel domain and positive k to Frechet domain. Interestingly, 
all the three DBFEs have been observed in experiments on microbial populations 
is m is eb Inina iia os. While the exponential distribution belonging 
to the Gumbel domain has been most commonly seen 0010 , in recent 
times, the distribution of beneficial mutations belonging to Weibull [Tf)l[T4] and 
Frechet m domains have also been observed. 
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Fig. 1: The figure shows the distribution of beneficial fitness effects 
p(f) with fitness / for the three EVT domains, given by Equation 

Eq. (Ill) for various n. Here n is the tuning parameter with k > 0, k —> 0 and 
k < 0 corresponding to Frechet, Gumbel and Weibull domains respectively. 
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Recent theoretical studies have shown analytically and numerically that 
qualitatively different patterns occur in the adaptation dynamics of populations 
in different EVT domains of DBFEs in low mutation regime Gang Hang- 
Specifically, it has been shown that the fitness gain in a fixation event follows the 
pattern of diminishing returns in Weibull domain, constant returns in Gumbel 
domain and accelerating returns in Frechet domain, and thus indicates that this 
quantity can be used to predict the DBFE. But these observations are restricted 
to strong selection-weak mutation (SSWM) regime in which the genetic varia¬ 
tion in the population is minimal, that is, only one beneficial mutation is present 
in the population in the time interval between its appearance and fixation [7]. It 
is then natural to ask whether the relationship between the adaptation dynam¬ 
ics and the DBFE mentioned above holds for large populations as well, where 
there might be more than one beneficial mutation competing for dominance in 
the population. The main aim of our study is to address this question and to 
see if the fitness gain in a fixation event can be used for predicting the DBFE 
in a more general scenario. 

Here we are mainly concerned with the populations in which a large number 
of mutants are produced at every generation. Hence, more than one beneficial 
mutation is expected to be present at the same time m nn 123 m ns. in 
this case, the beneficial mutations will compete with each other as has been 
observed in different experimental populations m H3 HE HD- In this high 
mutation regime, as a result of the competition among the beneficial mutations, 
the rate of adaptation slows down. The fitness advantage due to the mutations 
that get fixed is much higher, since the availability of more mutations results in 
allowing only the best (fittest) mutation to get fixed [28]. A clear comparison 
of the population fraction of new mutants appearing in the population for two 
mutation regimes is given in Fig. [2] In Fig. [D(a) we see that the population in 
the SSWM regime is more or less monomorphic with only one mutant present 
at a time in all the three EVT domains. However, in high mutation regime, 
population is polymorphic with more than one mutant produced in it at every 
generation as shown in Fig. [21(b) . In fact, a large amount of genetic variation is 
observed in the case of bounded distributions corresponding to k < 0 in Eq. ID 
resulting in strong competition between the beneficial mutants. 

In this work, we have used Wright-Fisher dynamics to study the adaptation 
dynamics of an asexual population in high and low mutation regimes for the 
three EVT domains of DBFE. The main motivation of this study is to look 
for quantities which can be used to distinguish between the DBFEs using the 
properties of adaptation dynamics as opposed to the direct measurements of 
DBFEs. Our most important and interesting result is concerned with fitness 
difference between mutations that spread in the population which shows quali¬ 
tatively different trends in three EVT domains and thus helps in distinguishing 
the DBFEs. 

We have also studied another quantity which is the rate of change of fitness 
with time, and observed that this shows quantitatively different behaviour for 
different EVT domains of the DBFEs. Though some results for the rate of 
change of fitness are already known in the literature [29], we measured it for all 
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Fig. 2: Population fraction of different mutant classes are shown as 
different coloured lines. Where, (a) shows the SSWM (N/i = 0.1, low mu¬ 
tation rate) regime and (b) shows the high mutation (N/j, = 10) regime for all 
three EVT domains of DBFE. 


the three cases (Weibull, Gumbel and Frechet) and identified that this can be 
used to distinguish the DBFEs in both SSWM and high mutation regimes. To 
obtain a complete picture, a comparison of our study with the existing literature 
is given in Table Q] below. 


Quantities 

DBFE domains: Low mutation regime 

DBFE domains: High mutation regime 


Weibull 

Gumbel 

Frechet 

Weibull 

Gumbel 

Frechet 

A fstep 

F(t) 

m 

m 

IIS 

this study 

this study 

this study 

this study 

EH 

m 

this study 

ESI 

this study 


Table 1: Here, A f step is the average fitness difference between the present leader 
and the new beneficial mutation that gets established and T(t) is the rate of 
change of fitness. 

We also measured quantities like the genetic variation and the number of 
mutations in the most populated sequence. All of these quantities are discussed 
in Results section. We suggest that the distinct trends shown by the above 
mentioned quantities can be used to predict DBFEs from experimental studies 
on adaptation. The relevance of our work to experiments is also explored in 
Discussion section. 


Materials and Methods 

We track the dynamics of a population of self-replicating (asexual), infinitely 
long binary sequences of fixed size using the standard Wright-Fisher process El 
[28] . In our work, the population size is held constant at N = 10 4 , unless specified 
otherwise and the total mutation probability (beneficial and deleterious) per 
sequence is given by Every occupied sequence is counted as a class and 
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labelled when it arises in the population. Initially, the whole population is 
in class 1 whose fitness is fixed and specified in every simulation run. We 
have used the term leader to refer to the class whose normalised probability of 
reproduction (product of population fraction and fitness) is greater than half. 
In that case, clearly, class 1 is the initial leader since the whole population is 
localized there. At every time step, out of N sequences, mt are chosen from 
a binomial distribution with mean N p, as mutants. Every mutant produced 
increases the number of classes in the population by one, and with time, the 
mutants may produce their own set of further mutants. The population fraction 
of each class may grow or go extinct, as can be observed in Fig. [2] At any time 
t, the number of classes present in the population is given by Af c {t ), and the 
population size and fitness of each class, i, where 1 < * < Af c , is denoted by 
n(i,t) and /(*), respectively. The normalized probability of each class at every 
time step, p(i, t) contributing offspring to the population at the next time step, 
depends on the population size of the class at the present time step and the 
fitness of the class as 


p{i,t) = 


n(i,t) f(i ) 


( 2 ) 


Note that though the fitness of the class is the same as long as it persists in the 
population, its size may vary at every time step, thus changing its probability 
of reproduction as given by Eq. ©• The different classes are populated in the 
next time step based on the multinomial distribution 


M c (t) 

P(n(l,t'),n(2,t')..n(^c,t')) = N\ [] 

3 =1 


[ pQ ',*)]" 0 ' 0 

n(j,t)\ 


( 3 ) 


where t' = t + 1. The above equation is subject to the constraint t') = 

N. In our simulations, we implement Eq. © along with the above constraint 
by converting Eq. © to a binomial distribution for every class, 1 < i < J\f c {t) 
as 


n(i, t') 


(m\ 


9 (M)" (i,t) (l-9(M))^ W_n(M) 


( 4 ) 


We set the population size of the last class as n(Af c (t ), t') = 1 

In Eq. ©, 






n(i, t'). 


( 5 ) 


and N{i) = TV - £}"in(j, t). 

At every time step, once the classes are populated based on the algorithm 
described above, m* sequences are chosen as mutants based on the binomial 
distribution with mean Np. Every new mutant class that appears in the popu¬ 
lation reduces the population size of the class in which it arose by one. In our 
work, we have varied [i to access both the SSWM (low mutation) and the high 
mutation regime. In our simulations unless specified otherwise, Np = 0.01 in 
low (SSWM) and Nfi = 50 in high mutation regimes. 
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A new class is assigned to each mutant and its fitness is chosen from a 
generalized Pareto distribution t 4j given in Eq. 0- The advantage of using 
Eq. (HI) is that we can access all three EVT domains of DBFE by changing n. 
The distributions whose k < 0 belong to the Weibull domain, while k = 0 belong 
to the Gumbel domain, and k > 0 belong to the Frechet domain, respectively. 
The frequency distribution of beneficial effects p(f) for various values of k is 
shown in Fig. [l] The upper bound u for the distributions chosen from Eq. 0 
is infinity when k > 0 and equals — 1 /n for k < 0. In this work, the fitness of 
the mutants is independently chosen from Eq. 0 thus making the fitness of the 
mutant, F m an uncorrelated variable, which may be greater or smaller than the 
parent fitness, F p . We have analyzed the results to see how they vary between 
the three EVT domains and different mutation rates. 

In the allocation of the fitness to any mutant, our work differs from the 
other works on clonal interference mm wherein the fitness of the mutant is 
hiked above the parent fitness by the selection coefficients (s) which may be held 
constant or chosen from a distribution as F rn = (1 + s)F p . Unlike the model 
we have used in this work (as explained above), in this case there is a strong 
correlation between the mutant fitness F m and the parent fitness F p . In those 
cases, the mutant fitness is always greater than the parent fitness and on an 
average, a double or higher mutant is fitter than a single mutant. This is in 
contrast with our work since in ours, as the fitness of the parent increases, the 
number of better mutants available decreases thus producing different patterns 
for the fitness increment in each EVT domain. 

In our model, whenever a mutant class goes extinct, the classes below it is 
moved up, and the number of classes in the population is reduced by one. The 
normalized probability of reproduction given in Eq. 0 of a class exceeding half 
corresponds to a leader change. The new leader determined now belongs to the 
class whose normalized probability exceeded half. We have also explored other 
criteria for defining the leader as the most populated class and find that our 
main results are robust with respect to the change in criteria (data not shown). 

Every change of leader is counted as a step. In the high mutation regime 
the population is spread over many sequences and a sequence can produce two 
or more mutants each of which may become leaders at different time steps. 
However, in the SSWM regime, the whole population is localized at a single 
sequence with a fixed fitness and can only move to a different sequence with 
higher fitness one mutation away. Thus every new leader arises from the previous 
leader, as can be observed in Fig. 0a). When a better sequence appearing in the 
population does not get lost due to genetic drift, it quickly gets fixed. Further 
mutations that may lead to future leaders appear in this genetic background. 
The change in the fitness of the population is the same as the change in fitness of 
the leader. In this case, every move of the population (leader) from one sequence 
to another is termed a step in the adaptive walk [3D) SH [321 S3] , whereas in high 
mutation regime, the population is polymorphic as can be seen from Fig. 0b) 
and the leader change is not obvious from the figure. 

Various quantities like the fitness difference between successive leaders and 
the average number of mutations in the leader are averaged only over the walks 
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that take the step. Other quantities like the number of classes present at any 
time point and the rate of change of fitness are averaged over all time steps in 
that simulation run. 

In this paper, the total number of iterations is 10° in every simulation run 
and the dynamics is tracked for finite time limit of 10 4 generations which we 
shall refer to as t ma x- In this time span, the maximum fitness value, f m ax that 
arises in the population can be calculated as 


tmax / p(f)df = 1 


( 6 ) 


where u is the upper limit of the fitness distribution equalling (-1 /k) for bounded 
distributions and infinity for unbounded ones [3]. From the above integral, we 
get 


fmax — 


{tmaxNnY - 1 


(7) 


Results 

The number of classes in the population 

For a population of fixed size, the number of classes in the population is expected 
to increase with the mutation rate. The average genetic variation defined here as 
the average number of classes (A f c ) present in the population is shown in Fig. [3] 
for all the three domains of DBFE. The top and bottom panels of the figure show 
the data corresponding to the high and low mutation regimes respectively. In 
both the mutation regimes, we see that the average number of classes increases 
during the initial time steps and decreases at later times when the classes with 
lower fitness are eliminated by the fitter ones. The maximum number of classes 
existing in the population for the first case, as shown in Fig. [3f a), does not 
belong to the lowest initial fitness, but to a slightly higher initial fitness. This 
could be because when the initial fitness is low, its class is quickly replaced 
by a fitter mutant and all further mutants arise on this new background must 
compete with this fitter class. 

In the low mutation regime, the population for the most time is localized at 
a single sequence and produces Nfj, mutants at every time step. So in this case, 
the average number of classes approach a constant N+ 1 at large times as 
can be seen in the bottom panels of Fig. [3] These panels also indicate that the 
value of this constant increases with decreasing k. This is because in the case of 
bounded distributions with k < 0, the fitness of beneficial mutant produced is 
expected to be closer to the parent fitness. In other words, mutations are nearly 
neutral and thus it takes longer time to take over the population as shown 
in Fig. Eta). This results in a larger number of mutants in Weibull domain 
which can be observed in the bottom panel of Fig. Et a). We can clearly see 
from the top panels of Fig. [3] that number of classes increases with decreasing 
k even in high mutation regime. Also, the average number of classes present 
at a time is much higher in this regime. This makes sense because the fitness 
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Fig. 3: The plot shows the average number of classes in the population 
as function of time for various initial fitnesses. The fitnesses are chosen 
from Eq. m with (a) re = — 1 (b) re — > 0 and (c) re = 1/4. For each re value, 
the plot shows J\f c (t) in both the high mutation (top panels) and low mutation 
(bottom panels) regimes. The straight line in all plots shows N/i+1. 


of the classes belonging to re = —1 cannot be very different from each other 
(can only vary between 0 and 1) which makes it possible for many of them 
to exist in the population. The maximum fitness of the classes belonging to 
re = 1/4 distribution will, on an average be much higher than all others (since 
the distribution is unbounded with a fat tail), thus out-competing the others in 
the population. 

Number of mutations in the leader 

In the low mutation regime, the average number of mutations in the leader 
is expected to be very close to the step number since the genetic variation 
in the population is low and any mutation that escapes drift quickly takes 
over the population [3]. We verify this point via simulations as depicted in 
Fig. |U We find that the mutation number equals the step in all the three 
EVT domains of the DBFE in the low mutation regime for the initial steps. 
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However in the high mutation regime, the number of mutations in the leader 
of any step differs between the three DBFE domains. When the mutation 
rate is increased, the genetic variation of the population and the significance 
of clonal interference also increases. In the high mutation regime, the number 
of mutations in the leader is found to be less than the step number in all the 
three DBFE domains. This is because there is a chance that different mutants 
originating from the same parent class can become the leader of the population 
at different times. This decrease from the step number is the minimum for the 
fat-tailed distributions and maximum for the truncated ones, as shown in Fig. [I] 
This result is consistent with the number of classes present in the population 
as discussed in the previous section. In the Frechet domain, since the clonal 
interference is minimal, mostly a mutant originating from the present leader 
will become the next one. In the Weibull domain, due to the large number of 
classes present in the population, mutants originating from the same class can 
become the leaders at different time points. 


0 1000 Time 



Fig. 4: The main plot shows the number of mutations in the leader 
at any step for various n and mutation rates. The simulation data are 
represented by points while the broken lines connect the data points. The solid 
line shows y = x. In the inset, from a single simulation run, the fitness of the 
whole population as a function of time is shown by broken lines and the fitness 
of the leader whenever the leader changes is shown by symbols. 
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Fitness and fitness difference 


From our simulations, we find that the average fitness of the first mutant fixed 
in the population, fi increases linearly with the initial fitness, fo for all k in 
the low mutation regime and for k ^ 0 in the high mutation regime. So we can 
write 

fi=ai N ^f 0 + bi N ^ ( 8 ) 

where the coefficients and are constants. In the low mutation regime, 
where the population for most times is monomorphic, the adaptive walk model 
has been used to analytically obtain the fitness at the first step, fi as ns] uni 

fi = r df T(f <- /„)/ (9) 

Jfo 

where the transition probability 


T{f <- fo) 


(1-e —^)p(f) 


ru I 

f fo dg (1-e * 


p(g) 


( 10 ) 


In this model, from Eq. ©, the coefficient was obtained as 0.33,1.0 and 1.6 

for n = —1,0, and 1/4 respectively. The corresponding fr|/ VAt<sl ' ) for the afore¬ 
mentioned « were 0.66, 2.0 and 1.89 m ■ In the high mutation regime where the 
adaptive walk model is not applicable, we obtained the values for the coefficients 
in Eq. J5]) numerically. We find that for large /o, equals 0.004 and 1.5 and 
b ( k°' 1 equals 0.99 and 9.1 for k = —1 and 1/4 respectively. 

The interesting result from our work is that, irrespective of the number of 
mutants produced in the population, the difference Af step = /i — /o between 
the fitness of the first step and the initial fitness displays different qualitative 
trends: increases for positive k, approaches a constant when k = 0 and decreases 
for negative k , as shown in Fig. [3 and [SI Fig.| 

We can better understand these increasing and decreasing trends by the 
following heuristic argument. In both the low and high mutation regimes, for 
large fo , the fitness at the first step fi increases linearly with the initial fitness 
as given in Eq. <[8|) and so, we can write the selection coefficient defined as the 
relative fitness difference, at the first step as 


/i — fo 
fo 


(qf M) - l)/p 
fo 



for all k , N/j, 


( 11 ) 


In an adapting population, since the fitness of the first step is greater than 
the initial fitness, the selection coefficient is always positive. As the fitness 
distributions belonging to the Frechet domain are unbounded with fat tails, 
high f 0 values can be considered in which case, the second term on the right 
hand side (RHS) of Eq. (fill) can be ignored and we can write s « ( a— 1) >0. 
Thus for k > 0, > 1 and therefore it follows that the fitness difference 
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(a) (b) 



Fig. 5: The main plot shows the fitness difference at the first step as 
a function of the initial fitness for various N fj,. The fitnesses are chosen 
from Eq. 0 with (a) re = —1 (b) re —0 and (c) re = 1/4. The solid lines in the 
main plot are obtained by numerically evaluating the integral given by Eq. ([9]). 
while the dotted lines are the approximate results that can be obtained for the 
results when the initial fitness is high, in the low mutation regime. The broken 
lines for re 7 ^ 0 are lines of best fit as mentioned in the text. The broken line 
for re — > 0 is used for connecting the data points. The inset shows the fitness 
difference at the first step as a comparative measure of the fitness difference 
obtained at the first step when / 0 = 0. Here, the lines are used for connecting 
the data points. 


at the first step increases with f 0 . On the other hand, since the distribution 
belonging to the Weibull domain are truncated, we can invoke the following 
inequality to explain the decrease in fitness difference with increasing f 0 : 

fi ~ So < u — fo, (12) 

where u is the upper limit of the fitness distribution. With increasing / 0 , the 
RHS of the above equation decreases which shows that as the initial fitness in¬ 
creases, fi — fo has to necessarily decrease. Thus the qualitative trends discussed 
above appear to be determined by the behaviour of the tail (bounded/unbounded), 
and not by the details of the model. 
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Also, it is interesting to note that while the data points for the exponentially 
decaying distribution (k = 0 ) increase and seem to be approaching a constant 
in the low mutation regime, the data in the high mutation regime seems to 
be reducing to approach the same constant. Our simulation results shown in 
Fig. [5] not only match the predicted theoretical values and validate the claim of 
different qualitative trends in each EVT domain in the SSWM regime but also 
show that the trends hold irrespective of the number of mutants produced in the 
population. This result suggests that the qualitatively different trends of the 
fitness difference (increasing, constant and decreasing with initial fitness in the 
Frechet, Gumbel and Weibull domain, respectively) can be used to distinguish 
between the EVT domains in a more general scenario. 

Though the fitness difference at the first step is greater in the high mutation 
regime, when compared with the results in the low mutation regime, when 
we look at the fitness difference at the first step scaled by the fitness difference 
obtained when the initial fitness is zero (insets of Fig. [5]), we see that this increase 
is slower in the high mutation regime compared to the results obtained in the 
low mutation regime. This indicates that as the mutation rate increases, though 
the number of mutants accessed is higher, the difference in fitness compared to 
a lower initial fitness is not proportionally higher and is in fact lower for all the 
fitness distributions. 


Rate of change of fitness with time 

Besides the fitness increment at a fixed event of leader change, we also measured 
the fitness as a function of time as shown in Fig. [G] We observed that even 
though the fitness increases with time in all the three EVT domains, the rate 
at which the fitness increases depends strongly on the DBFE. This rate has an 
initial fast transient phase, after which it slows down. 

The initial transient phase is strongly dependent on the initial condition 
as well as the mutation rate as shown in [52 Fig. | The increase in fitness is 
fastest for the lowest initial condition, but it approaches the same fitness value 
as in the case of higher initial fitness in few generations. The time taken for 
populations of different initial fitness to reach the same fitness value depends 
on the mutation rate: for Nfi 1 , it takes about 20 generations, whereas for 
Nfj, -C 1, it is approximately 200 generations. Even after this transient phase, 
the rate of increase in average fitness (.F(t)) with time depends on the mutation 
rate as shown in Fig. [G] This is because of the fact that, when a large number 
of mutations are available at the same time, a highly fit mutant can invade 
the population and give a large fitness increment. So the fitness of a highly fit 
mutant sequence would be greater in the high mutation regime compared to 
the one in low mutation regime. The maximum fitness value reached in 9000 
generations, in the case of Frechet distribution is about 10 times more for high 
mutation regime, which is consistent with the expectation from Eq. 0- Even 
beyond this point we noticed that the fitness is still increasing. In the same 
way, Gumbel distribution also shows a significant increase in maximum fitness 
reached in high mutation regime compared to the SSWM regime (about 4 times). 
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Fig. 6: Figure shows the average fitness increase with time for three 
different values of k in the SSWM regime (Nn = 0.01), and in high 
mutation regim e(N/i = 50). In all the cases, the population starts with the 
same initial fitness /o = 0.5. 


Here also we found that the fitness is still increasing beyond the time point till 
which we tracked the dynamics. The bounded distribution (Weibull) reaches 
near the upper bound in SSWM and evolves slowly. But the fitness reaches a 
fitness plateau in high mutation regime and rate of adaptation becomes zero as 
can be seen in Fig. [6] 

From this we observe that the rate of change of fitness depends strongly 
on the properties of the underlying DBFE, which suggests that looking at this 
quantity can help us in distinguishing the DBFEs. So we measured the fitness 
increment defined as 

A P{t) = {T(t + 1) - T(t)) (13) 

at each step. The A T(t) initially increases, then slowly decreases and settles 
down to a zero as shown in Fig. [7] If we denote this function as 

a m = £ (i4) 

where A is a constant and the exponent a can be used to distinguish the DBFE, 
since, as explained below, exponent a is found to be greater (smaller) than one 
in Weibull (Frechet) domain, but is close to one in Gumbel domain. 

In the SSWM regime, from Fig. [TJa), we can see that each type of DBFE 
considered shows a different rate of decay. Weibull domain has a faster decay 
with a = 1.86, Gumbel domain has a « 1 [55] and Frechet domain a = 0.66 
[55] , We observed that the same trend holds in high mutation rate regime as 
well, where a values are slightly larger in all cases. In this regime also a = 2.02, 
1 and 0.76 for Weibull, Gumbel and Frechet domains respectively as shown in 
Fig. [T](b). In the high mutation regime, in the case of Weibull distributions 
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Fig. 7: Figure shows the fitness increment in each time step for three 
different values of n in two mutation regimes (SSWM and high muta¬ 
tion). In each case the data is fitted with the theoretically expected function 
given in Eq. da. except for exponential distribution for which we used the 
theoretical prediction by Park and Krug [29] . In all the cases, the population 
starts with the same initial fitness /o = 0.5. 


fitness reaches a plateau in few generations, after which its rate of change goes 
to zero, as can be observed in Fig. [7[b). The theoretical prediction for the 
fitness at every time step for the unbounded distributions belonging to the 
Gumbel and Frechet domains was obtained by Park and Krug [29] in the low 
mutation regime. The comparison of our simulation data with these predictions 
shows a very good agreement in Gumbel domain and in Frechet domain (up 
to a constant). In this work, we have considered the bounded distribution also 
and observed that its rate of decrease is faster with an exponent greater than 
one, which was not considered in the previous studies. We observed that even 
in high mutation regime, the exponent a shows the same behaviour. In this 
regime the rate of change of fitness has been calculated only for exponential 
distribution belonging to the Gumbel domain [29] and their prediction matches 
with our data. In this work, we have obtained a complete picture by studying 
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the rate of change of fitness numerically for the other two EVT domains as well. 

Thus, the second main finding from our study is that in all DBFEs, the 
fitness difference at each time step decreases with time as given by Eq. (Hi and 
we can distinguish between the three EVT domains of DBFEs by looking at the 
exponent a. A comparison of our results with the existing literature is given in 
Table [T] 

Discussion 

The main purpose of our work is to determine the quantities which can be used to 
distinguish the different extreme value domains of the DBFE. Previous studies 
Ha HI] have found that in an adapting population, the fitness gain at each 
fixation event shows qualitatively different trends in the three DBFE domain, 
when the number of mutants produced in the population is much less than one at 
every generation (Nfj, C 1). The focus of this work is to explore the parameter 
regime in which the number of mutants produced is much above one (N/j, 1). 

When the mutation rate is high, the population becomes polymorphic and the 
better mutants existing in the population compete with each other. From our 
study we have observed that the qualitative trends found for fitness difference 
when a new mutation establishes in the low mutation regime hold irrespective 
of the number of mutants produced. Thus this study suggests that fitness 
difference between the successive mutations that spreads in the population is a 
very important and robust quantity that can be used to predict the DBFEs in 
a more general scenario. 

From our simulations, we see that as the initial fitness is increased the fit¬ 
ness difference at the first step given by A f s tep reduces, approaches a constant 
or increases with initial fitness in the Weibull, Gumbel and Frechet domains, 
respectively. We can understand these trends by a heuristic reasoning as dis¬ 
cussed in detail in the Results section. This argument explains the increase in 
A fstep with fo for unbounded power law distribution and shows that the trends 
are determined by the behaviour of the tail (bounded/unbounded), and not by 
the details of the model. 

Another important measure in understanding the dynamics of adaptation 
is the rate at which it occurs. Most of the previous studies which measured 
the adaptation rate have only considered exponentially distributed fitness dis¬ 
tributions [20 [2TJ 22l [28l [34] . A previous study by Park and Krug [29] also 
considered DBFEs belonging to Frechet domain but only in the SSWM regime 
(see Table [T]). In this work, we have extended the previous studies by numeri¬ 
cally measuring the rate of change of fitness for bounded distributions also. We 
have measured the rate of change of fitness in all the three EVT domains of 
the DBFE in both low and high mutation regimes. We observed that in all the 
cases, the rate of change of fitness decreases with time as ~ t - “, where a > 1 
for Weibull, a s=s 1 for Gumbel [29] and a < 1 for Frechet domains [29] . 

Experimentally, the distribution of beneficial fitness effects can be inferred 
by two methods. In the first method, mutations are introduced in the wild 
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type sequence and those that confer a fitness advantage are separated and their 
distribution of fitness effects are determined. By this method, DBFE belonging 
to all the EVT domains have been observed ummmmmmmmm- 
In contrast, here we focus on learning about DBFE via adaptation dynamics. 
Though many works have tracked the dynamics of the population during adap¬ 
tation m na mi 133 ng, in most of them only the selection coefficient of the 
mutant fixed was measured. In our study, we have observed that the selection 
coefficient as given by Eq. m always decreases, with the increasing initial fit¬ 
ness or increasing steps as shown in |S3 Fig.| Hence this quantity is not useful 
to distinguish between the EVT domains. However, from our study we observe 
that the fitness difference between steps shows different patterns depending on 
the EVT domain of the DBFEs in both the high and low mutation regimes and 
can be used to distinguish between the EVT domains. 

In this work, we have numerically shown that the fitness returns in each EVT 
domain is very robust and holds even when the number of mutations produced 
is large (Nfi 1). Fitness difference can be measured in experiments, for 

example as in [5]. We suggest that experiments can predict the EVT domain of 
DBFE by measuring the fitness difference between successive mutations fixed in 
the population, or even from the fitness of the first mutation, when the initial 
fitness is varied. However currently experimental studies that measure both 
fitness and DBFE in the same study are not available but it is highly desirable 
to have such studies to test our predictions. 
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Figure SI Fig.: The plot shows the fitness difference at the first step 
as a function of the initial fitness for different k and two different Nfi. 

The lines give the theoretical values while the open symbols are the simulation 
output for Nn = 0.02 and the closed symbols are those for Nn = 5. 
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Figure S2 Fig.: The figure shows the average fitness of the population 
for various k in both the low and high mutation regimes. Two different 
initial conditions fo = 0 (open symbols) and /o = 0.5 (closed symbols) are 
considered. 


21 










Figure S3 Fig.: The main figure shows the selection coefficient as a 
function of step for all three re values. We considered two different N/j, 
where open symbols and closed symbols are for N/j, = 0.01 and Nfj, = 50, 
respectively. The inset shows the selection coefficient of various steps for two 
different the initial fitnesses / 0 = 0.2 f max and f 0 = 0.6 fmax, where f max is 
calculated using dTJ) in the high mutation regime. 
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